CDS

Accession Number TCMCG067C51718
gbkey CDS
Protein Id KAF8045414.1
Location complement(join(174..360,531..704,888..990,1083..1238,1544..1652,1732..1925,3525..3645,3756..3843,4265..4334,4427..4547,4671..4838,5182..5367,5791..5877,5971..6113,6216..6387,6473..6568,6651..6733,7019..7136,7226..7363,7440..7508,8109..8351))
Organism Sinapis alba
locus_tag N665_4968s0001

Protein

Length 941aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA214277, BioSample:SAMN02744833
db_source MU110518.1
Definition hypothetical protein N665_4968s0001 [Sinapis alba]
Locus_tag N665_4968s0001

EGGNOG-MAPPER Annotation

COG_category GMO
Description Belongs to the glycosyl hydrolase 31 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00028        [VIEW IN KEGG]
R00801        [VIEW IN KEGG]
R00802        [VIEW IN KEGG]
R06087        [VIEW IN KEGG]
R06088        [VIEW IN KEGG]
KEGG_rclass RC00028        [VIEW IN KEGG]
RC00049        [VIEW IN KEGG]
RC00077        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01187        [VIEW IN KEGG]
EC 3.2.1.20        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00052        [VIEW IN KEGG]
ko00500        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00052        [VIEW IN KEGG]
map00500        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGACAGTTAGTGGAGATAACTCGGAAACTATAGGTATTACTCCAACCCAAATGATATTTGAACCGATCTTGGAGCATGGAGTTTTCAGGTTTGACTGTTCCGTGGAGCATAAGAAAGCTGCATTTCCAAGTGTTTCATTTAAAAACATCAAAGACAGGGAGGTTCCGGTTACAAGTCACAATGTTCCAGCATATACTCCTACTTGTGTTTGTCTTGAGGAGAAGCAAGTTGTCACTTTTGAGTTTTCTCCTGGCACATCGTTTTATGGGACGGGAGAAGTTGGTGGCGAGTTGGAGAGAACAGGAAAACGGGTGTTTACATGGAACACTGATGCATGGGGATATGGACCTGGAACCACTCCACTGTACCAGTCACATCCATGGGTGCTAGTTGTTCTTCCAACTGGAGAAACACTTGGTGTTCTTGCTGATACAACACAAAAGTGTGAGATTGATCTGAGGAAAGAAGGGATCATAAGAATAATCGCTCCAACAGCATATCCAATTATCACATTTGGTCCATTCTCTTCACCCACTGATGTCTTGGAGTCCTTGTCACACGCCATAGGAACTGTTTTCATGCCTCCAAAGTGGGCCTTAGGCTACCACCAATGCCGTTTTAGTTATATGTCAGAGAAGCGAGTGGCTGAGATAGCTCAAACATTTCGTGACAAGAAGATTCCTGCAGACGTGATATGGATGGATATTGACTACATGGATGGTTTTCGTTGTTTCACATTCGACAAGGAACGTTTCCCGGATCCTAGTGCTTTAGCAAACTATCTTCATAATAATGGTTTCAAAGCGGTATGGATGCTTGACCCAGGAATTAAGCAAGAGGAAGGTTATTTTGTCTATGATAGTGGAAGCAAAGACGATGTTTGGGTTAAACAAGCAGATGGGAAGAAGCCCTTCATTGGTGAAGTATGGCCTGGACCTTGCGTTTTCCCTGACTATACAAACTCCAAAACTCGGTCTTGGTGGGCTAGTTTGGTTAAAGACTTTATATCAAATGGTGTTGATGGTATATGGAACGATATGAATGAGCCTGCTGTTTTCAACACCGTGACAAAGACAATGCCTGAGAACAATATTCACTGTGGAGACGATGAGCTTGGAGGTGTTCAAAACCACTCACACTACCACAACGTCTATGGGACTCTGATGGCAAGGTCCACTTATGAAGGGATTGAGCTAGCTGATCAGAATAAGCGACCTTTTGTATTGACAAGGGCTGGATTCATAGGGAGCCAGAGATATGCTGCTACTTGGACAGGAGATAACCTTTCAACCTGGGAGCATCTACATATGTCTATATCCATGGTACTTCAGCTGGGGCTTAGTGGCCAGCCCTTATCAGGGCCAGATATAGGTGGCTTTGGTGGCAATGCAACTCCGAGGCTATTTGGACGTTGGATGGGCATTGGAGCTATGTTTCCATTCTGTCGTGGCCATTCTGAGGTTGCCACTGATGACCATGAGCCATGGTCATTCGGAGAAGAGTGTGAGGAAGTCTGTCGTGCTGCATTGAAGAGGCGCTACCAACTGTTACCACATTTTTACACCCTATTTTACATTGCCCATACAACTGGTGCTCCAGTTGCAGCTCCAATCTTTTTCGCTGATCCGAAAGATTCTAGACTAAGGACTGTTGAAAACGCATTTCTTTTGGGTCCACTTCTCATATATGCAAGCACTTTAAATAATCAGGGATCGCATGAGCTGCACCACATGTTGCCCAAAGGAACTTGGTTGAGGTTTGATTTTGAAGATTCACATCCGGATTTACCGACATTATATTTACGAGGTGGATCCATTATATCAATGGCTCCTCCACATCTACATGTTGGGGAGTTTACTCTGTCAGATGACTTGACTCTACTTGTGTCATTAGATGAAAATGGCAAAGCTGAAGGCCTCTTGTTTGAGGATGATGGAGATGGATATGGCTACACTAAAGGAAGATTTCTAATCACAAACTACATTGCTGAGAAGCATTCGTCCATTGTTACTGTTAAGGTTTCAAAAACTGAAGGAGATTGGCAGAGGCCAAAGCGCCGTGTTCATGTCCGGCTATTACTAGGAGGCGGTGCAATGCTTGATGCTTGGGGAATGGATGGAGAGACTATCCAGATTATTGTGCCTTCAGAAAGTGAAGTTTCAGAATTAATAACCACCAGCAATGAGCGCTTCAAACTTCATATGGAAAATACAAAACTGATACCTGAGAAGGAAATTCTACCTGGAGAAAAGGGAATGGAACTATCAAGATTGCCAGTTGAGCTAAACAATGGCAATTGGAAACTAAACATAATTCCTTGGATTGGTGGAAGGATATTGTCCATGTCACATGTTCCATCAGGAGTACAATGGCTTCATAGCAGGATAGATATCAATGGTTATGAAGAGTACAGCAGTACTGAGTACCGGTCATCTGGATGTACTGAGGAATACAATATCATTGAGACGGATTTAGAACATGCAAGAGAGGAAAAATCACTTATCCTAGAAGGTGATGTAGGTGGTGGACTTGTTCTTCAGCGTAAAATTACCATACCTAATGACAATCCCAACATTCTTCGAATTGCCTCTATCATTGAAGCCCGTAGTGTTGGTGCTGGTTCTGGTGGATTTTCAAGGCTGGCATGCTTAAGAGTCCATCCAACTTTCACTATCTTTCATCCAACCAAATCGTTTGTCTCATTCACATCGATTGATGGTACAAATCATGAAGTCTCGCCAGATTCTGGAGAGAAATTATATGAAGGGAACAACCTCCCACATGGTACATACTTGTATACTCGTCATATTGTTCTCATGTTTTAG
Protein:  
MTVSGDNSETIGITPTQMIFEPILEHGVFRFDCSVEHKKAAFPSVSFKNIKDREVPVTSHNVPAYTPTCVCLEEKQVVTFEFSPGTSFYGTGEVGGELERTGKRVFTWNTDAWGYGPGTTPLYQSHPWVLVVLPTGETLGVLADTTQKCEIDLRKEGIIRIIAPTAYPIITFGPFSSPTDVLESLSHAIGTVFMPPKWALGYHQCRFSYMSEKRVAEIAQTFRDKKIPADVIWMDIDYMDGFRCFTFDKERFPDPSALANYLHNNGFKAVWMLDPGIKQEEGYFVYDSGSKDDVWVKQADGKKPFIGEVWPGPCVFPDYTNSKTRSWWASLVKDFISNGVDGIWNDMNEPAVFNTVTKTMPENNIHCGDDELGGVQNHSHYHNVYGTLMARSTYEGIELADQNKRPFVLTRAGFIGSQRYAATWTGDNLSTWEHLHMSISMVLQLGLSGQPLSGPDIGGFGGNATPRLFGRWMGIGAMFPFCRGHSEVATDDHEPWSFGEECEEVCRAALKRRYQLLPHFYTLFYIAHTTGAPVAAPIFFADPKDSRLRTVENAFLLGPLLIYASTLNNQGSHELHHMLPKGTWLRFDFEDSHPDLPTLYLRGGSIISMAPPHLHVGEFTLSDDLTLLVSLDENGKAEGLLFEDDGDGYGYTKGRFLITNYIAEKHSSIVTVKVSKTEGDWQRPKRRVHVRLLLGGGAMLDAWGMDGETIQIIVPSESEVSELITTSNERFKLHMENTKLIPEKEILPGEKGMELSRLPVELNNGNWKLNIIPWIGGRILSMSHVPSGVQWLHSRIDINGYEEYSSTEYRSSGCTEEYNIIETDLEHAREEKSLILEGDVGGGLVLQRKITIPNDNPNILRIASIIEARSVGAGSGGFSRLACLRVHPTFTIFHPTKSFVSFTSIDGTNHEVSPDSGEKLYEGNNLPHGTYLYTRHIVLMF